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Abstract. The objective of the paper is to identify and investigate ah possible 
types of asymptotic behavior for the maximum likehhood estimators of the unknown 
parameters in the second-order hnear stochastic ordinary differential equation driven 
' by Gaussian white noise. The emphasis is on the non-ergodic case, when the roots of 

. , the corresponding characteristic equation are not both in the left half-plane. 

=3 



X 



1. Introduction 



E"' I Consider the stochastic ordinary differential equation 

^ (1.1) x{t) = d^x{t) + e^xit) + aW{t), t > 0, 

I with a standard Brownian motion W = W(t), non-random initial conditions X{0), 

X[0), and two real parameters di, 62. Equation (11.11) is often referred to as a continuous 
time auto-regression of second order, or CAR(2), being a particular case of CAR(N) 

C^: (1.2) X^^^ = J20N-kX^'^ + W; 

i> : fe=o 

see [1]. A rigorous interpretation of equation (11.11) is the system 

(1.3) dX = Xdt, dX = {e2X + eiX)dt + adW{t). 

^ ! In the matrix-vector form, system (11.31) becomes a particular case of the multi-dimen- 

sional Ornstein-Uhlenbeck process studied in |2j: 

(1.4) dX{t) = QX{t)dt + (TdW{t), 



H ■ with 



The estimator studied in [2j is 

^ {dX{t)X^{t)) \ l^j^ X{t)X^{t)dt\ . 

It is strongly consistent as T — 00, and this implies strong consistency of the maximum 
likelihood estimators for 61 and 62 in (11.11) for all {01,62) G M?; see ^ or Theorem 13.11 
below. 

When the process X defined by (ll.4p is ergodic (equivalently, when all eigenvalues 
of the matrix B are in the left half-plane), it is known [H Theorem 4.6.2] that the 
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estimator Gr is asymptotically normal with rate T^/^. Some non-ergodic models of the 
type fll.4l) have also been studied (see, for example, [IS])- The key object in the analysis 
of a maximum likelihood estimator (MLE) is the normalized log-likelihood ratio. A 
desirable property of this normalized log-likelihood ratio is local asymptotic normality 
(LAN), as it implies certain efficiency of the MLE; see [7, Chapter II]. While the LAN 
property is typically associated with ergodic models, it can also hold in some non- 
ergodic models [9j. Still, in many non-ergodic models, LAN is replaced with LAMN 
(local asymptotic mixed normality), leading to a different kind of efficiency of the MLE; 
see [121 Chapter 5]. 

So far, analysis of fll.4p in general and f ll.ip in particular was aimed at identifying the 
particular cases that fit in either LAN or LAMN framework. It turns out that many 
of the non-ergodic regimes of f 1 1.11) lead to new asymptotic forms of the normalized 
log-likelihood ratio and to new types of asymptotic behavior of the MLE. In the cur- 
rent paper, we present a complete asymptotic analysis of the MLE and the likelihood 
ratio for all possible values of the unknown parameters {61,62) G M^. Beside purely 
theoretical interest, this analysis can help in the investigation of the corresponding 
discrete-time models (see [4J in the ergodic case). 

The rest of the paper is organized as follows. Section [3] states the main results 
and discusses the results in the broader context of statistical estimation. After some 
preliminary work in Section IH the proofs are in Section O followed by a brief summary 
in Section O 

We fix a stochastic basis (fi, J^, {J^t}t>o,^) with a standard Brownian motion W = 
W{t). Other common notations are E for the expectation with respect to P, a/— 1 for 
the imaginary unit, ^ for the transpose of a vector or a matrix, g{t), g{t) for the first 

and second time derivatives of the function g, = for equality in distribution of random 

variables, = for equality in law of random processes, and Oa.s.(7') to denote a process 
converging to zero with probability one as T — )■ 00. 



2. Summary of the main results 
To write the likelihood ratio for equation (11. ip . define the vectors 

Then equation (11. ip becomes 

(2.2) dX = 6^Xdt + adW{t). 

Since X{t) = X(0) + J^X{s)ds, it follows from (D that 

(1) X = X{t) is a diffusion-type process in the sense of Liptser and Shiryaev ( [T^ 
Definition 4.2.7]) and generates the measure on the space of continuous 
functions on [0,T], 
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(2) For every eM."^, the measure is absolutely continuous with respect to the 
measure P° generated by the process X{0) + aWit), <t <T, and 

(2.3) / T T 

= exp(^^^ {92X{t) + 9iX{t))dX{t)-^ i92X{t) + 9iX{t)fdt 

see [ini Theorem 7.6]. 

The expressions for the maximum likelihood estimators 9i t of 9i and 6'2,t of 92, using 
continuous-time observations of both X{t), and X{t), < t < T, now follow from 

JoX\t)dtJ^X{t)dX{t) - J^X{t)X{t)dtj;^Xit)dX{t) 

j^x^{t)dtj^x^{t)dt-[j;^x{t)x{t)dty 
^^■^^ ^ ^ /o^x^(t)rft/;x(t)dx(t) - /;x(t)x(t)dt/;x(t )cix(t) 

/;X2(t)rft/;X2(t)rft - (/;x(t)x(t)rft' ' 
Similar to (12. ip we write 

(2.5) Ot = 




The amount of integration in (12.41) can be reduced using rules of the usual and sto- 
chastic calculus and keeping in mind that the processes X is continuously differentiable 
and the process X is a continuous semi-martingale with quadratic variation equal to 

X{t)X{t)dt = / X{t)dX{t) = ^, 



^0 

T 



(2.6) / X{t)dX{t) 



X\T) - a^T - X\0) 



^ 

[ X{t)dX{t) = X{T)X{T) - X(0)X(0) - I X\t)dt. 
Jo Jo 

With the continuous time observations, the value of a can be assumed known because 
the quadratic variation process of X at time t, an observable quantity, is aH. Note 
also that a does not appear in the formulas (12. 4p . 

The measure P° in (12. 3 p corresponds to the solution of (12. 2 p with 9i = 92 = 0. 
Sometimes it is more convenient to work with the reference measure coming from some 
other solution of (II. ip . for example, the actual observations. Accordingly, let us fix 
e and the corresponding observation process X = X{t), < t < T, satisfying 
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(foil . Then, for every ^ eR^, 

i^^W = ^/ i^'' x{t) - e'' x{t))dx{t) 



(2.7) 



see [ini Theorem 7.19]. With and X fixed, we write 



M^) = ln^(X). 

Define the matrix 

/ J^X\t)dt J^X{t)X{t)dt 

(2.8) ^T= / X{t)X^{t)dt 



/;x(t)x(t)dt /;x2(t)rft 



Then and ([22D imply 



Lt{^) = - [ {^-eyx{t)dw{t)-^! {^-ey x{t)x^{t){'d-e)dt 

(2.9) dyX{t)dW{t) - - ^)^^(T) - 0). 

We address the following questions about the estimators 9i{T),62{T): 

(1) rate of convergence, that is, finding positive deterministic functions Vi{T), 
i = 1,2, such that, as T — )■ cxd, Vi{T) +oo and Vi{T)(6i^T — Gi) converge 
in distribution to non-degenerate random variables, and identifying the corre- 
sponding hmit distributions; 

(2) existence of a normal limit with a random rate (NLRR), that is, finding 
a random matrix R = Rt such that, as T ^ oo, Rt{Ot — 0) converges in 
distribution to a Gaussian random vector. The hope is that at least one of 
the two things happens: (a) the random rate leads to a normal limit when a 
deterministic rate does not; (b) neither the matrix Rt nor the parameters of 
the limit distribution depend explicitly on the 61,62- 

(3) local asymptotic structure of the normalized log-likelihood ratio, that is, 
analyzing 

(2.10) iriu) = Lrie + Atu), 

as T — )• 00, where is fixed. At G R^^^ is a suitably chosen deterministic 
matrix with limT_j.oo |^t| = (any matrix norm will work), and n G M^. 
Let us recall some definitions related to the normalized log- likelihood ratio fl2.10p . 
It follows from I^M> that 

(2.11) iM = - [ (u^ATX(t)]dW(t)-^u^Al^TATU. 

cr Jo \ / 2a^ 

With a suitable choice of the matrix Ajn, there exists a non-trivial limit in distribution 

(2.12) i^(u) = lim iriu). 

T->-oo 
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Three particular cases of It satisfying f l2.12p are of special interest: 

• Local Asymptotic Normality (LAN), if there exists a bivariate normal vector 
^ with mean zero and non-degenerate covariance matrix such that, for every 

u e 

(2.13) ^^{u) = -n^l - -i-n^S^^x. 

• Local Asymptotic Mixed Normality (LAMN) if there exist a bivariate nor- 
mal vector r] with zero mean and unit covariance matrix, and a random sym- 
metric positive definite matrix B G M^^^ such that B and rj are independent 
and, for every u G ffi^, 

(2.14) i^{u) = -u'B^'^'n - ^u'Bu. 

If fl2.14p holds with a degenerate matrix S, we refer to It as DLAMN (degen- 
erate locally asymptotically mixed normal). 

• Local Asymptotic Brownian Functional structure (LABF) if 

(2.15) ioo{u) = - f G{t)dw{t) ~ ^ I u^G{t)G^{t)udt, 

Jo 2(T^ Jo 

where G G M^^^ is an adapted process, G is a standard Brownian motion, 
and the pair (G, w) is a Gaussian process. 

The definitions of LAN, LAMN, and LABF extend to more general likelihood ratios 
and to any finite number of unknown parameters. For details see [71 Chapter II] (LAN), 
P Chapter 1] (LAMN), and ^ Section 2] (unified approach to LAN, LAMN, and 
LABF). 

The LAN and LAMN properties of ix imply certain asymptotic efficiency of the 
corresponding maximum likelihood estimator (MLE): 

(1) If £t is LAN, then the corresponding MLE is asymptotically efficient in the 
sense of achieving the lower bound in the Cramer-Rao inequality; for details 
see [3, Theorem II. 12.1]. 

(2) If is LAMN, then the corresponding MLE has the maximal concentration 
property; for details, see [U Theorem 2.2.1]. Note that the result requires 
non-degeneracy of the matrix B and therefore does not immediately extend to 
DLAMN. 

In the LABF case, there are results about asymptotic efficiency of Bayessian estimators 
[TOj Section 3, Proposition 10] and sequential estimators [6^ Theorem 2]. 

To put our results in perspective, let us recall the estimation problem of the drift 6 
in the CAR(l) model, which is the one-dimensional OU process Y = Y{t) defined by 

(2.16) dY{t) = eY{t)dt + adW{t). 
Here is a summary of the results. For details, see [5l [11] . 

• The maximum likelihood estimator 9t of 9 using the observations of Y{t), < 
t < T is 

(2.1T) ir ^ llX^; 
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the estimator is strongly consistent as T — )■ cxd: limr-i^oo Ot = with probability 
one for all ^ G M. 

If 6^ < (asymptotically stable or ergodic case), then 



(2.18) ^lim ^/\e[f{eT -9) = V2 \9\ ^, 

where ^ is a standard normal random variable. 

• If 6' = (neutrally stable case), then 

(2.19) hm T(9t - 0) = "^'^^^-^ , 

where w = w{s), < s < 1, is a standard Brownian motion. 

• If 6' > (unstable or explosive case), then 

(2.20) lim e^^(^T - 9) = 26 



T-s-oo ^ + C 

where ^ = / ~ e-^'dW{t) is a standard normal random variable, 77 is a 

standard normal random variable independent of ^, and c = \/26Y{0)/(r. In 
particular, if 1^(0) = 0, then the limit has the Cauchy distribution and does 
not depend on a. 

• If ^ ^ 0, then NLRR holds: 

/ .T X 1/2 

(2.21) W ^^W^M {OT-d)=(TV, 

where r/ is a standard normal random variable. 

• The normalized log-likelihood ratio 

j;^Yit)dWit) IoY\t)dt 



is LAN if ^ < 0, LABF if ^ = 0, and LAMN if ^ > 0. 

Note that (a) the limit distributions in fl2.18p . f l2.19p . and f l2.2ip do not depend on the 
initial condition; (b) equality f l2.2ip illustrates the attractive features of NLRR: the 
rate 

r-T \ 1/2 



R{T) = (^J^ Y\t)dt^ 



does not explicitly depend on 9 and the limit distribution does not depend on 9 or the 
initial conditions. 

Table 1 summarizes the results, where Fd(w) denotes a generic functional of the 
standard (i- dimensional Brownian motion and Ch denotes a Cauchy-type distribution 
(ratio of two independent normal random variables). 



Let us now turn to equation (11. ip . Asymptotic behavior of estimators (12.40 depends 
on the roots p, q of the characteristic equation 

(2.22) - 9ir -92 = 0. 
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Table 1. Estimation in CAR(l) 

Parameter Rate LD ir NLRR 

^ < Vt Af LAN Yes 

^ = T Fi{w) LABF No 

^ > e^^ Ch LAMN Yes 

There are nine cases to consider: 

(1) The asymptotically stable (ergodic) case, when 9i < and 02 < 0. All in 
all, there are three possibilities for the roots: q < p < 0, g = p < 0, or 
p = X + v^— 1 u, q = X — V— 1 z/, with A < 0, but the asymptotic behavior of 
the estimators is the same in all three cases. 

(2) Six non-ergodic cases with real p,q: q<p = 0,q<0<p,p>q = 0,p>q>0, 
p = q > 0, or p = q = 0. 

(3) The harmonic oscillator, when p = y/ —1 u, q = —yZ—lu, u > 0; 

(4) Unstable oscillations, when p = X + \/^-Vv^ q = X — z/, and A > 0, z/ > 0. 

Table 2 summarizes the results for CAR(2). The detailed statements are in Section 
[3l In Table 2, we use the same notations as in Table 1. In particular, Fd{w) denotes 
a functional of the standard o?- dimensional Brownian motion and Ch is a Cauchy-type 
distribution. 



Table 2. Estimation in CAR(2) 



Case 




V2 


LDi 


LD2 


NLRR 




^1 < 0, ^2 < 


Vt 


Vt 


AT 


AT 


Yes 


LAN 


q < < p 


Vt 


Vt 


M 


Af 


Yes 


DLAMN 


< q < p 






Ch 


Ch 


Yes 


DLAMN 


< q = p 






Ch 


Ch 


Yes 


DLAMN 


q < p = 


Vt 


T 


M 


Fi{w) 


Yes (^i,t) 


LABF/LAN 


q = < p 


T 


T 




Fi{w) 


No 


DLAMN 


q = p = 


T 


T2 


Fi{w) 


FiH 


No 


LABF 


3fJ(p) = 


T 


T 


F2{w) 


F2{w) 


No 


LABF 


= A > 






Many 


Many 


Yes 


LAMN family 



There are obvious similarities between CAR(l) and CAR(2) in the ergodic case 
and, perhaps less obvious, similarities in the neutrally stable case (6' = in CAR(l) 
compared to p = yf—lu in CAR(2)) and in the exponentially unstable cases (6' > in 
CAR(l) compared to real p, g > in CAR(2)). In both CAR(l) and CAR(2), the rate 
Vt corresponds to normal distribution in the limit, exponential rate corresponds to 
a Cauchy-type distribution, and any rate polynomial in T leads to some functional of 
the standard Brownian motion. In the case of the positive double root {p = q > 0) the 
rate is slightly slower than exponential, but the limit distribution is still of the Cauchy 
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type. Also of interest are (a) several appearances of DLAMN instead of LAMN, (b) an 
unusual combination of asymptotic normality of the estimators and DLAMN (rather 
than LAN) of ix when g < < p, (c) relative compactness rather than convergence 
in distribution for both the estimators and the normalized log-likelihood ratio when 
p = \ + a/^z/, a > 0. 

It is instructive to compare CAR(2) with p = A + -\/— Tz^, A > 0, > 0, and the 
example considered in [161 Section 4.1]: 

While the eigenvalues of the matrix in fl2.23p are also A ± a/— Iz/, the special structure 
of the model ensures that the normalized local log-likelihood ratio is LAMN and the 
MLEs of A and z/, when normalized by a/2 Ae'^-^, converge to a joint limit (which, for 
zero initial conditions, is the bivariate t2-distribution). 

3. Asymptotic properties of the MLE and the normalized 

log-likelihood ratio 

Strong consistency of (12. 4p is a consequence of a more general result by Basak and 
Lee [2] . For the sake of completeness, here are the statement and the proof. 

Theorem 3.1. With probability one, lim-r^oo ^i,t = (ind limj-^oo ^2,t = 62 for every 
Proof. With Qt defined in (11. 5p . we find using ( 11. ip that 




The statement of the theorem now follows from [21 Theorem 2.1 and Remark 3.1]. □ 

Next, we present the theorem describing the limit distributions of 6i^t and 62^t- The 
proofs are in Section [5l To keep visual track of the formulas, it is convenient to think 
of the process X in ( II. ip as a dimensionless quantity and to measure t in the units [t] of 
time. Table 3 summarizes the resulting dimensions of all the variables and parameters 
in the problem. 

Table 3. Dimensions in CAR(2) 

Quantity t X{t) W{t) X(t), 9i, ^ij, P, q (r X{t), 62, 

Units [t] None [tY'^ [t]-^ [t]-^/^ [t]-^ 



One caveat: the auxiliary Brownian motion w{s), < s < 1, and its parameter s, 
as well as all random variables appearing in the limit distributions, are dimensionless. 
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Theorem 3.2 (Rate of convergence and limit distributions). 

I. Ergodic case. Assume that 9i < and 62 < 0, and let rii,ri2 be iid standard 
normal random variables. Then 



(3.1) lim VT|^(^i,t - ^1) = V2\e,\ r],, lim y^T\e^\{92,T - ^2) = VW^W^il 



V2- 



For the rest of the theorem, denote by p and q the roots of equation (12.221) . 

II. NON-ERGODIC CASE: DISTINCT REAL ROOTS. Let^,r] be Hd standard normal 
random variables, and let w = w{s), < s < 1, be a standard Brownian motion 
independent ofrj. 

(a) If p > and q < 0, then 

(3.2) lim {Oi,T - Oi) = -- lim (^2,t - ^2) = ^2 

T^oo p T^oo 

(b) Ifp>q> 0, then 

(3.3) lim e'^^iA,r - ^1) = -- lim (^,,^ - 62) ^ ^^^^ 

T^oo P T^oo p - q ^ + c 

where c= {X{0) - pX{0)) /a. 

(c) If p = and q < 0, then 61 = q, 62 = 0, and 

(3.4) lim v^(^i,T - ^1) = V2\e^\ V, lim m,T = "^^1'] • 

2 /q w^{s)ds 

(d) If p > and q = 0. Then Oi = p, 62 = 0, and 

(3.5) lim OiT (Oi ^ - ^i) = - lim T^2 t = Oi ^ ~ ^ . 

III. NON-ERGODIC CASE: A DOUBLE ROOT. 
(a) If p = q > 0, then 9i = 2q, 62 = , and 

(3.6) lim — (ei,T - ^1) = — lim — (e2,T - ^2) = 4\/2g- ' 



T^ocqT'^'' qr^ooqT'^'' ^' i^c 

where c = ^ (X(0) - pX(0)) ja and ^, r\ are iid standard normal random variables. 
(b) Ifp = g = 0, then 61 = 0, 02 = 0, and 

d 233^(1)-!) -23^(^l)3l- 32) 

lim TOiT = ^ 1 

(^j. ' 43233 

2^ . 432(«;(l)3i-32)-3K«^'(l)-l) 



lim T ^2 T 



where w = w{s), < s < 1, is a standard Brownian motion, and 

3i = / w{s)ds, 32 = / w'^{s)ds, 33 = / I / ty(s)ds I cit. 



\^o 



IV. NON-ERGODIC CASE: COMPLEX ROOTS. 
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(a) If p = y/^iy, u > 0, then 6i = 62 = —v^, and 

2 - wf{l) - wlil) 



(3.8) 



lim T9i T ^ 1 1 , 

^-00 ' wf{t)dt + j^' wim 



lim ne^^r - e,) ^ -1 W^-^(^) - /o Mt)dw,(t) 



where wi,W2 are independent standard Brownian motions. 

(b) Ifp = A+i/^z/, A > 0, u > 0, then, fori = 1,2, the families {e^'^ {Oi^T-Oi), T > 
0} are relatively compact, and all the limit distributions are the form 

where (CcCs) O'^d {f]c,Vs) ore independent bivariate normal vectors, Ef/c = Ef/^ = 0, 
and the mean values of ond S,s depend on the initial conditions X(0),X(0). 

One general conclusion of Theorem 13.21 is that, if p > and q < p, then it is the 
value of the smaller root q that determines asymptotic behavior of the estimators. This 
result comes as a surprise: the asymptotic behavior of both estimators is dictated by 
the non-dominant mode, even though this mode is "invisible" with probability one. 
Indeed, the solution of ( II. ip is a Gaussian process 

(3.9) X{t)=X{0)xi{t)+X{0)x2{t) + a [ X2{t - s)dW{s), 

Jo 

where the functions xi{t),X2{t) form the fundamental system of solutions for the equa- 
tion 

(3.10) x{t) - 9ix{t) - e2x{t) = 0. 

In other words, xi(0) = l,i:i(0) = 0, ^2(0) = 0, X2(0) = 1, and both xi = xi(t) and 
X2 = X2(t) satisfy (13.101) . The roots of the characteristic equation (12.221) are 

(3.11) p= ^ , q- 



2 ^ 2 

Then, with the usual modifications for complex, p, q, 

(qeP'-pe"' f e^* - e?* 

(3.12) xi{t) = i q-p X2{t} = < p-q 

[ (1 — qt)e'^^, if p = g; [te'^*, if p = g. 

When the roots (13. lip are real and distinct, equation (I3.10p has two Lyapunov expo- 
nents, p and q, and it follows from (13. 9p that if g < p, then 



P lim -ln|X(t)| = g = P lim -ln|X(t)| = g = 

\t^oo t J \t^oo t J 

for all initial conditions X{0), X{0). Thus, Theorem 13.21 shows that if the larger 
Lyapunov exponent of (I3.10p is positive, then the asymptotic behavior of the estimators 
is determined by the smaller Lyapunov exponent. 
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As another illustration of the effects of the exponentially unstable mode, note that 
if 6^2 = 0, then f ll.ip becomes 

(3.13) dX = 9iX + (TW. 

In other words, X is a CAR(l) process. If 6i < 0, then asymptotic behavior of Oi^t 
and ^2,T and similar to the CAR(l) situation in ergodic and neutrally stable cases, 
respectively. If ^2 > 0, then there is no clear similarity with CAR(l). 

When p > 0, the normalized limits of 6'i.r — 9i and ^2,r — ^2 are negative multiples of 
each other. Some correlation is also present when p = \ + -\/— Tz/, A > 0, > 0. This 
type of correlation in non-ergodic multi-parameter models has been observed before; 
see, for example, [121 Section 4.1]. Still, as the case g < = p shows, lack of ergodicity 
does not necessarily imply correlation of the limits. 

One can verify qualitative consistency of the results of Theorem 13.21 by considering 
various limiting regimes for p and q. For example, passing to limit p \ g in (13. 3p 
suggests that the rate in the case of the positive double root should be slower than 
exponential. 

Next, we study the possibility of NLRR, that is, existence of a random matrix 
R = R(T) such that R(T){dT — 0) converges in distribution to a bivariate normal 
vector. 

Theorem 3.3 (Normal Limit with a Random Rate (NLRR)). Denote by p and q the 
roots of equation (I2.22p . Normal limit with a random rate is possible in the following 
six cases: ergodic; p>q>0;p = q>0;q<0<p;q<0 = p (for 9i^t only); 
p = \ + \f-\v, A > 0, > 0. 

Ergodic case. Assume that Q\ < 0, ^2 < 0, and let r]i,ri2 be iid standard normal 
random variables. Then 

lim 1 / X\t)dt] {eiT-Oi) = (Jr]i, 

(3.14) ' ' 



^lim (^^ X\t)dt^ (e2,T-02) = cTr]2. 



T \ 1/2 

f / 

T-s>oo 

For the rest of the theorem, denote by t] a standard normal random variable. 
District Positive Roots. Assume that p > q > and define 

T \ 1/2 



(3.15) ""^^^^ (/ {^it)-pXit))'dt^ 
Then 

(3.16) lim r(T)(9iT-0i) = lim r(T)(^2T-^2) = ^^^crr]. 

T-i-oo ' ' ' ' p T-i-oo ' ' ' • P~ q 

Positive Double Root. Assume that p = q > and define 

r-T \ 1/2 



r{T) = ^2(1 
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Then 

(3.17) lim riTMdiT- Oi) = -- lim r(T)(^2T - ^2) = 2V2par]. 

T->oo ' p T-i>oo ' 

Roots of opposite sign. Assume that q < < p and let r{T) he as in fl3.15p . 
Then 

(3.18) lim r(T)(^ir-^i) = -- lim r(T) (^2 t - ^2) = cr 77. 

T^oo ' p T-5-00 ' 

Zero root. Assume that g < 0, p = 0, and define 

r 



^(T) = / X\t)dt. 

Jo 



Then 

2 

(3.19) lim r{T){9,,T - 9^) = r/. 

Complex roots. Assume that p = X + \f—\v, A > 0, z/ > and define matrices 

Then there exists a bivariate normal vector (Us,Uc) such that the family 

B{us,Uc)AT'^T{eT-0), T>0, 

is relatively compact, and all partial limits are bivariate normal random vectors inde- 
pendent of (Us, Uc). 

Theorem 13.31 suggests two negative conclusions: (a) \E' is usually not the correct 
random normalization (which is especially striking when p = X + a/ —lu); (b) existence 
of NLRR in non-ergodic CAR(2) is not as helpful as in CAR(l), because the rates 
and/or the limit distributions depend on the unknown parameters. 

Finally, we describe the asymptotic structure of the normalized log-likelihood ratio 

(EUD. 

Theorem 3.4 (The structure of ir)- Denote by p and q the roots of equation fl2.22p . 
by diag(a;, y) the diagonal matrix with x and y on the main diagonal, and by bp the 
column vector 

If 01 < 0, ^2 < (ergodic case), and At = diag(T-i/^ T^^/^)^ ^/^g^ £^ i^^^ 

If p > 0, q < p, and At = e~'P'^bpbJ , then It is degenerate LAMN and the matrix 
B m (I27[4l) IS 

(3.20) B=^-^^^a\^bpbJ, 
where ( is a standard normal random variable. 
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If q <{) = p and At = diag(T-^ T^^/^) ^/^g^ £^ ^^^^g^ LABF/LAN: 



13 



(w) = — Bu, 



where 



^ (2|g|)-VVr/ ' V cr\2\q\)-^ ) ' 

7] is a standard normal random variable, w is a standard Brownian motion, and rj and 
w are independent. 

If P = Q > ^ O'lT'd At = T~^e~'^^bpbJ , then £t is degenerate LAMN and the matrix 
B m ^a:Wf IS given by f lCTj) . 

If P = Q = ^ O'lT'd At = dia.g{T~'^,T~^), then It is LABF and the matrix G{t) in 
IS 



Git) 



a Jq w{s)ds 

aw{t) ) ' 



where w is a standard Brownian motions. 

If p = \f—\v, z/ > 0, and At = disig{T^^,T^^), then It is LABF and the matrix 
G{t) m f l2T5|) %s 



awi{t) aw2(t) 
-aw2{t) awiit) 



where Wi and W2 are independent standard Brownian motions. 
Ifp = \ + \f^^v. A, z/ > 0, and 

then {Et, T > 0} is a relatively compact LAMN family having all partial limits of the 
form fl?l4D . 

One general conclusion of Theorem 13.41 is that, if p > and q < p, then it is 
the value of the larger root p that determines asymptotic behavior of the normalized 
log-likelihood ratio, which is in sharp contrast with Theorem 13.21 
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4. Preparation for the proofs 

To study asymptotic behavior of 9i^t and 6'2,t, we need the expressions for the 
residuals 9i^T — Oi, i = 1,2: 
(4.1) 

[f,^X\t)dt) (f;^X{t)adW{t)) - (f^X{t)X{t)dt) (f^ X{t)adW{t) 

dl,T — dl = 



72,T 



/;X2(t)rft) (/;x2(t)rft) - (/;x(t)x(t)dt 

J^X\t)dt) fj;;^X{t)adW{t)) - (j^X{t)X{t)dt) fj;^X{t)adW{t) 



j.^x^m) iioX^m) - (/; xmm 



These equahties foUow directly from (12. 4p and (II. ip . 

Equation (II. ip has a closed-form solution (13. 9p . meaning that (14. ip can be written in 
terms of integrals of the type f {s)dW {s) . Unfortunately, this direct approach quickly 
leads to intractable expressions. Therefore, despite availability of explicit formulas, 
a more sophisticated approach to the analysis of (14.11) is necessary. In the ergodic 
case, the ergodic theorem provides all the necessary tools, and when 6i = 62 = 0, the 
expressions are simplified using self-similarity of the standard Brownian motion. Other 
cases benefit from the following construction. 

Given two square-integrable on [0,T] functions f,g, define 



T \ / rT 

2/ 



(4.2) N{T;f,g)=[ f\t)dt [ g{t)adWit) 



T 



f{t)g{t)dt / f{t)adW{t) 



(4.3) D{T- f, g) = f{t)dt^ g'it)d?j - f{t)g{t)dt 

Clearly, D{T] /, g) = D{T; g, /), but in general N{T; /, g) 7^ N{T; g, /). Then formulas 
iH) become 



n n N{T;X,X) - n N{T;X,X) 

(4.4) C'l T — D'l = : — , UOT — C'o = : — . 

^ ^ ' D{T;X,Xy ' D{T-X,X) 

The general idea of the proof of Theorem 13.21 is to find the asymptotic behavior of 
D{T;X,X), N{T;X,X), and N{T;X,X), as T cx). To keep track of the results, 
note that, according to Table 3, D{T; X, X) is dimensionless, N(T; X, X) is measured 
in [t]~^, and N{T] X , X) is measured in 

For every real numbers a, /3, 7, k and every square-integrable functions f,g, we have 
the following identities: 

(4.5) D(T; af + f3g, 7/ + ^g) = {aK - f3jfD{T; f, g), 

(4.6) N{T; af + /3g, 7/ + Kg) = {a\ - aP^)N{T; /, g) + (/?% - a/3K)N{T; g, /). 

Recall that Oa.s.(i) denotes a continuous random process converging to zero with 
probability one as t — )■ 00. We will often use the following result: if f{t) > is a 



ESTIMATION IN LINEAR CAR(2) 15 

continuous process, F{T) = f{s)ds, and liuiT^oo F{T) = oo with probability one, 
then 

(4.7) -^^^/(t)Oa.s.(t)rft = Oa.s.(r). 

Indeed, (14 .Tp is immediate if the integral /q°° /(t) Oa.s.(^)c?t converges; otherwise, (14 .Tp 
follows after one application of L 'Hospital's rule. 

Here are some other technical results to be used later. For r > 0, define Gaussian 
random variables 

/•oo i*T 

(4.8) ^r= e-'-'dWis), ri,{T) = / e-''^'^-'Uw{s). 

Jo Jo 

We have Er/^(T) = 0, Er/2(T) = (1 - e-2^^)/(2r), and therefore 

(4.9) hm Vr{T)=7^r, 

where the random variable rjr is normal with mean zero and variance l/(2r). Since 

(4.10) MqVr{T) = e-'^ I e^'-'i^'ds ^0, T -> oo, 

Jo 

it follows that rjr and are independent for every g, r > 0. 

A more sophisticated version of the above observations is the following result. 

Proposition 4.1. Let M = {Mi{t) , . . . , Md{t)) , < t < 1, be a d-dimensional con- 
tinuous Gaussian martingale with M(0) = 0, and let = (Mx^iit), . . . , Mx^dit)), 
T > 0, < t < 1, be a family of continuous square-integrable d-dimensional martin- 
gales such that Mr(0) = for all T and, for every t G [0, 1] and i,j = l,...,d, 

lim {MT,^,MT,J){t) = {M,,M,){t) 

T— ^oo 

in probability. Then limT_!.oo Mt = M in the topology of continuous functions on [0, 1]. 

Proof. Modulo a non-essential (in this case) difference between a sequence and a family 
indexed by the positive reals, this is a particular case of Theorem VIII. 3. 11 in [8J. □ 

For every r > 0, the process rir{t), t > 0, is ergodic (in fact, strictly mixing). 
Therefore, the ergodic theorem implies 

(4.11) Vr{t)dt = 0,,,XT), r^^^^t)dt = ^ + 0,,,XT), 

and, together with Proposition 14.11 

1 



(4.12) 



hm -j= rVr{t)dW{t)^rj^, 

T^oo y/T Jo 

/ rT \ -1/2 

^lim n Vr{t)dt] J Tir{t)dW{t) = V2^ r]i 



where r]^ is normal with mean zero and variance l/(2r), and the random variables 
(C?) Vt) Vr) joiiitly independent for every q,r > 0. 
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Asymptotic analysis of certain stochastic integrals can benefit from the law of iter- 
ated logarithm. Recall that if / is locally square-integrable adapted process, M{t) = 
Jq f{s)dW{s) and {M){t) = p{s)ds +00, t — )■ 00, with probability one, then 
there exists a standard Brownian motion W such that 

M(t) = iy((M)(t)). 

The law of iterated logarithm for W implies 

(4.13) lim ^ ^^^ = Oas (T) 

for every e > 0. For example, if e > 0, then 

(4.14) T-'yir{T) = Oa.s.(T). 

To conclude the general discussion we establish an integration by parts formula. As 
a motivation, recall that analysis of CAR(l) in the exponentially unstable case leads 
to the function Vp{t) = e^^*~*^(il^(s), p > 0, for which integration by parts shows 
that 

(4.15) e-^^ Vpit)dWit) - eP%r/p(T)^ = Oa.s.(r). 

Since exponentially unstable solutions of equation f l3.10p are of the form e^*/(t), where 
the function / grows at most polynomially, we generalize (14.151) as follows. 

Proposition 4.2. Given a deterministic (for simplicity) and locally square-integrable 
function f, define Sf{t) = f^^ f{s)dW{s). 
Let functions ip and ip be such that 



(4.16) 



r+00 

/ tp\t)dt = +00, eP''\^{T)\+e-P^\tP{T)\<C{l + T-), T>0, 
Jo 



for some p,C,r > 0. Then 



(4.17) ^''"^(Z S^{t)dS^{t)-S^{J)S^{T)^=^^.,{T) for all g > 0. 
Proof. By the Ito formula, 

(4.18) S^{T)S^{T)- f S^{t)dS^{t)= ! ^{t)iP{t)dt+ I S4t)dS^{t), 



and it follows from fl4.16p that 

e""^ [ \ifi{t)ij{t)\dt < Cie-'?^(1 + T"+^) -> 0, T -> 00. 
Jo 

To estimate the second term on the right-hand side of (14.181) . recall that, if M = M{t) 
is a continuous square-integrable martingale, then, by the strong law of large numbers, 
a finite limit 

M(T) 
T^l+ (M)(T) 

exists with probability one ([HI Corollary 2 to Theorem 2.6.10]). As a result, if F = 
F{t) is a function such that F{T){M){T) = Oa.s.(T), then F{T)M{T) = Oa.s.(T). 
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Next, consider 

N{t)= fs4s)dS^{s)= f S4sMs)dW{s) with {N){t)= f Sl{s)^\s)ds. 
Jo Jo Jo 

We need to show that e-'^^{N){T) = Oa.s.(T). By (I416D . 

(4.19) {S^){t) = ti;\s)ds < C,e'P\l + f n, 

Jo 

and therefore 

(4.20) hm e-'''{S^){t)if\t) < lim C^e-'^'il + t^') = 0, g > 0. 

t—^oo t—^oo 

By assumption, sup^{S^){t) = oo. We then use fl4.13p and fl4.19p to conclude that 

lim , =0, £ > 0. 

T^oo{S^){T)T- 

Therefore, 

^^j-SlmAT) = ^.-.J'e-'-(S,m'AT)jg^, = 0, 

which, by L'Hospital's rule, implies limr^oo e~^^(A^)(T) = 0a.s.(2^)- 

This completes the proof of Proposition \4.2\ □ 

We will use Proposition 14.21 with q = p to simplify various stochastic integrals. As 
a quick illustration, let us verify fl4.15p . Take ip(t) = e~^*, ipit) = e^*- Then, together 
with (Oil , equality f l4.17p imphes 



-pT 



\J0 



P^'-'^dW{s)^ dW{t) = (^J^ e-P^^-'UW{t)^ (^J^ e-P'dW{t) 
+Oa.s.(T) =r/p(r)ep + Oa.s.(T). 



5. Proofs of Theorems I3.2H3.4I 
Proof of Theorem 13:21 

I. The ergodic case. See P Remark 2 after Theorem 4.6.2]. 

II. NON-ERGODIC CASE: DISTINCT REAL ROOTS. U p ^ q, then (13.90 and fl3.12p 
imply 

(5.1) X{t) = Vp{t)-V,{t), X{t)=pVp{t)-qVg{t), 

where 

p-q p-qJo 
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By (USD and fOjl 

D{T;X,X) = {p- qfD{T;Vp,V,), 
(5.3) N{T; X, X) = (p - q) (iV(T, V^, V^) + N{T; V„ V^)) , 

iV(T; X, X) = -(p - q) (pN{T, Vp, V,) + gX(T; V„ V,)) . 

To complete the proof, it now remains to use the specific expressions for the functions 
Vp and Vq, which we do next. 

11(a). Roots of opposite sign: g < < p. Using notations (Ojl . define 
Gaussian random variable (p by 

_ A-(0)-A-(0), ^ ^ ^ M^A(% ^ ^ 

p-g p-q Jo P-Q P-Q 

and note that 

^pW = e^'iCp + Oa.s.(t)), = -^^,m + ^W--^W^ e^^ 

V ^ p — q ' ' P — q 

Then 

/ V;(i)nW*= / e"(Cr + 0„,(«))l/,(i)<ii = e''''(Cp>)|,|(r)+0„.(r)), 

^0 JO 

[ Vpit)dWit) = VTeP^O^.,.{T), lim ^ / 
Plugging the results into (15. 3p . 



= V\q\- 

p — q 



DriT;X,X)={Te^^^) (| + Oa.s.(T) ) , 



p-2pT 2 ^2 2pT 



Then both equalities in (13. 2 p follow from (14.41) . 

11(b). Distinct positive roots: < q < p. When p > q > 0, computations are 
very similar to the case q < < p. The difference comes from the fact that, for q > 0, 
we have 

V,W.e«.(^^, + M^^ + 0„.W 

\p — q P — Q 

Using Proposition 14.21 and notations 

^ _ X(0)-X(0)g , a ^ ^ _ X(0)-X(0)p , a ^ 
p — q P — Q P ~ Q P ~ Q 
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we find 

f5 4) D(T- X X)= 'i^3^lZ3}le^ip+<i)T 



lim e-(^+^-)^iV(T; V,^ V,) ^ ' ^) 

T^oo ^ ^ \2p p + qj 



lim e-^-^+^rtT^^y. y^^ y^^ £ Q 

T— ^oo 



It remains to observe that 

Vq Vp 



2p p + q 

is a Gaussian random variable, independent of (Cp, Cq), with mean zero and 
{p — qY / {8p'^q{p + qY). Then both equalities in (13. 3p follow from fl4.4p . 

11(c). Larger root is zero: q < p = 0. By (15. ip with p = 0, 

(5.5) X{t) = Vo{t) - Vq{t), X{t) = -qVg{t), 
and 

|g| |g| |g| |g| 

Using dm]), dug), and fOD . we find: 

(5.6) rn^(t)rft = ^(l + 0_(T)), 



(5.7) V^it)dt=^-^Q^ W\t)dt + OUT)y 

(5.8) / Vo{t)X{t)dt = T^N^,,.{T), 

Jo 

(5.9) to^-L£l.«)<„y«)i^,,x. 

(5. 10) £ U,{t)dW{t) = ^ C^'"'^^'^ + 0.... (T)) . 

Self-similarity of the standard Brownian motion implies 

(5.11) l!3Z^i!^H^. l^l^r-m^l^^i.),.. 

Combining (ISlGjl - dSTT]) with dES]) yields 

D{T;X,X) = ^i^— W\t)dt + Q,.,{T)y 

limT-/^iV(T;X,X)^^,i,/^ 

4 



w {s)ds, 



}imT-^N{T;X,X)^—{w\l)-l). 



Then both equalities in (13. 4 p follow from (14. 4p . 
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To show independence of 1] and w, take a standard Brownian motion w = 
t G [0, 1], that is independent of w and apply Proposition 14.11 with 

M{t) = {w{t), wit)), Mrit) = I dW{s), ^ yi\,\{s)dW{s) 

11(d). Smaller root is zero: g = < p. We have 
X{t) = Vp{t)-V^{t), X{t)=pV,{t), 

Voit) = Uoit) = ^(^)'^(^)P + lw{t), V,{t) = e^'iC + 0,.3.(t)), 

p p 



oo 



P P Jo 
Then 

T 2pT rT 2rp2 / T fT \ 

K"m = ^{Ci + o..s.{T)), ^^'^^^^^ = ^ [t-^ I W\t)dt + 0^.^.{T)j 







f Vo{t)Vp{t)dt = TeP'^0,.sXT), [ Vp{t)dW{t) = e^''{Cpr]p{T) + 0,,,XT))., 

^0 

U,{t)dW{t) = — 1^-^ + 0_(T) j . 



By (O, 



D{T-X,X) = ^^^I^Q- l^2(t)rft + 0_(r)j 
N{T- X, X) = ^Te^-^ (C,^ ^'^11'^ + Oa.s.(T)) 



and then both equahties in ( 13. 5p follow from ( 14. 4p . 

III(a). Positive double root: p = q> 0. With xi{t) = (l-gt)e'?*, X2{t) = te''\ 
(13. 9p becomes 



X{t) = X(0)(1 - gt)e«* + X(0)te«* + a [ [t - s)e''^'-'^dW{s), X{t) = qX{t) + Q{t), 

Jo 

Q{t) = (^X{0)-X{0)q + a e~'^'dW{s)^ e'^K 

r+oo 

( = X(0) - X{0)q + CT e-'^'dWis), 

Jo 

then 

x(t) = te^*(c + o,.,.(t)), g(t) = e«*(C + 0,.,.(t)). 



where 



If we define 
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By (USD, 

(5.12) D{T; X,X) = e^^^ + 0_(r)^ ; 

note that the same result follows after passing to the limit p \ g in fl5.4p . 
Next, define 



Jo Jo 

and observe that 

(5-13) ^^^T{r^,AT)-V,,2iT))=^, 

where r/ is a standard Gaussian random variable, independent of (. 
Therefore, 

^ X(t)Q(t)dt = Te2'?*f^ + 0a.s.(T)V ^ g(t)rfiy(t) = e'?^(Cr/,,i(T) + Oa.s.(T)), 

^ X{t)dW{t) = Te''^(c^,,2(T) + Oa.s.(r)). 

To continue, 

N{T;X,Q) = ^rV'^^(r/,,i(T) -r^,,2(r))(C=' + 0a.s.(T)), 

It remains to observe that 

NiT; X, X) = N{T; X, Q), Ar(T; X, X) = -qN{T- X, Q) + iV(T; Q, X). 

Then both equalities in (13 .Op follow from fl4.4p and f l5.13p . 

III(b). Zero double root: p = g = 0. In this case the result follows directly 

from (14. ip using self-similarity of the standard Brownian motion: W{T-) = y/Tw{-). 
Recall the notations 

3i = / w{s)ds, 32 = / w'^{s)ds, 33 = / I / w{s)ds J dt. 
Jo Jo Jo \Jo / 

With 9i = 02 = 0, equation (11. ip becomes dX = adW, and therefore 

X{t) = X{0) + aW{t), X(t) = X(0) +X(0)t + (T [ W{s)ds. 

Jo 
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Then 



D{T; X, X) ^ ^ (43233 - 3i + Oa.s. (T)) , 
N{T; X, X) ^ ^ (33 (^'(1) - 1) - 3? (^(1)31 " 32) + Oa.s. (T) 
X(T; X, X) ^ ^ (432 («;(l)3i - 32) -3l{w\l)-l)+ OUT)) , 

leading to ([32D- If X(0) = X(0) = 0, then equahties in ([321) hold for every T > 0. 

IV(a). Complex roots: p = z/ > 0. This case is the subject of [13]. 

Below we outline the main steps. 
By (O, 

Xif) = X(0) cos(z/t) + sin(z/t) + - / sm{u{t - s))dW{s), 

^ ^ Jo 

X(t) = -X(0)z/sin(z/t) +X(0)cos(z/t) + a / cos{u {t - s))dW{s). 



By Proposition 14. as T — 00, the pair 



VtJo 



pit cy pit 

/ sin(z/s)rfiy(s), ^ / sin(z/s)diy(s), t e [0, 1] 



converges in distribution to a two-dimensional standard Brownian motion (^1,^2). 
Then 

hm f X\t)dt^f-^ [\wl{s) + wl{s))ds, 

hm rx\t)dt^^ [\wl{s)+wl{s))ds, 

i / X(t)diy(t) ^ ^ / (u7i(s)c/u;2(s) - ti;2(s)rfti;i(s)), 
Jo Jo 



lim — 

T^oo T 

lim — 

T^oo T 



W Xit)dWit) = ^[ {wi{s)dwiis) + W2is)dw2{s)). 

Finally, f gl3|l implies X2(T) = T^Oa.s.lT). Then both equalities in ([31]) follow from 
fliT|) . For details, see pj] Sections 2-4]. 

IV(b). Complex roots: p = \ + > 0, > 0. The fundamental system 

of solutions of (13.101) in this case is 

Xi{t) = — (z/cos z/t — A sin ut) , X2{t) = — e'^* sin ut. 
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Therefore, 

(5.14) X{t) = ^ ((X(0) - X(O)A) sin ut + X(0)z/ cos z/t) 

+ ^ /" e^(*^'^)sinz/(t-s)diy(s), 
^ Jo 

(5.15) X(t) = \X{t) + uY{t), 

At 

(5.16) Y{t) = — ((X(0) - X(O)A) cos ut - X(0)z/ sin z/t) 



V 

+ ^ f e^^'-'^ cos u{t - s)dW{s). 

y .In 



Define Gaussian random variables 

X(0)-X(0)A a 



+ - / e'^^ COS iytdW{t), 
^ Jo 



(5.17) 

u, = -X(0) + - / e"^* sin ut dW{t) 
^ Jo 

and the functions 

V^{t) = e^* cos ut, Vsit) = e^* sin i^t. 

Then 

(5.18) X{t) = u,Vs{t) - UsV,{t) + e^%,,Xt), Y{t) = u,K(t) + u,V,{t) + e^%,,Xt). 
By m, 

4AT 4 

(5.19) ^(T; X, X) = ((u^ + + Oa.s.(T)) . 
Next, define Gaussian random variables 

(5.20) ^^{T)= f e^^'-^^ cos vtdW{t), \)s{T) = I e^^'-^'^ smutdW{t). 

Jo Jo 

For each T > 0, the vector (f)c(T), \)s{T)) is bivariate normal with zero mean, 

(5.21) al{T) = ¥.\)1{T) = ^ (l + yp=f cos(2z/T - 0)^ + o(l), 

(5.22) a,2(r) = E[)^(T) = ^ (l - yp=f cos(2z/T - 0)^ + o(l), 

(5.23) a,,(T) = ¥.UT)UT) = , J ^ sin(2z.T - 0) + o(l), 

where cos0 = A/VA^ + z/^, sin0 = z//VA^ + z^^, and o(l) denotes a non-random func- 
tion e = e(T) such that limr-i-oo e(r) = 0. Thus, the family {[)c(T), P)s(T), T > 0} is 
relatively compact, with limit points being bivariate Gaussian vectors (f}c, f)s)- Com- 
putations similar to (I4.10p show that each vector (f)c, ()s) is independent of (uc, u<j). 
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On the other hand, 

NiT; Vs) = ae'^^ {al{T)UT) - crMT)), 
iV(T; = ae=^^^((T,'(T)[)e(T) - (T,,f),(r)). 

To complete the proof, it remains to express N{T\ X, X) and N{T] X, X) in terms of 
X(T; K, K) and X(T; K, K) using (gS]) and Proposition 
r/izs completes the proof of Theorem \3.2i 

Proof of Theorem 13.31 The proof is straightforward analysis of the computations 
in the proof of Theorem 13.21 with the goal to find suitable normalization leading to the 
Gaussian limit. Let us illustrate this in the most interesting case, when p = X + 
and the corresponding rate matrix Rt is not diagonal. In this case, f l5.14p -f l5.18p and 
fl5.20p . together with Proposition 14. 2[ imply 



(5.25) 



J^x{s)dW{s) \ ^ f -1 o \ ( Us -uc W + Oa.s.(r) 

J^X{s)dWis) J u \-X u J \uc Us J \ UT) + 0..s.{T) 



[B[Us,u,)) y [3^(T) + 0a.,.(T) 
On the other hand, according to (14. ip . 




j^X{s)dW{s) 
^ I j^X{s)dW{s) 



That is. 



B(u..ujA.*.(§.-.) = (wp, + °;;;-[^)), 

This completes the proof of Theorem \3.3\. 

Proof of Theorem 13. 4i The proof is straightforward analysis of the computations 
in the proof of Theorem 13.21 this time with an emphasis on the asymptotic behavior 
of the matrix ^!np and the vector X(t)dW{t). The analysis is easy in the ergodic 
case and also when p = or p = yj—lv. When p = A + a/— Iz/, the proof is essentially 
complete after f l5.25p . 

Here are the results when p > 0. If g < p, then 

j\{t)dW{t) = e^'^Cp(^p(T) + Oa.jT))bp, = ^e^P^(bpbJ + 0,,,XT)y 
If q = p, then 

j\{t)dW{t)=Te^^C{np,2iT) + 0..s.{T))bp, v]/^ = ^ TV^^(6p + O^.JT)) . 
Note also that 

{b,b;Y = {i+p')b,bj. 



This completes the proof of Theorem \3.4 
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6. Summary and Discussion 

The maximum likelihood estimator (12 .17^ in CAR(l), that is, the one-dimensional 
OU process fl2.16p . has three types of asymptotic regimes, depending on the sign of the 
parameter 9. For CAR(2), that is, the second-order equation fll.ip with two unknown 
parameters, while the MLE still has the same form and is strongly consistent for all 
values of the parameters, the number of different asymptotic regimes is nine. This jump 
in complexity underlines the challenges related to the analysis of the general estimation 
problem, either for the iV-th order linear equation (CAR(iV)) or an A^-by-A^ system. 

If equation (I2.22P has real roots and one of them is positive, then the following two 
features seem to be common: 

(1) The rate of convergence of the estimators is determined by the smaller root of 
equation (12:221) : 

(2) The asymptotic behavior of the normalized log-likelihood ratio is determined 
by the larger root of equation f|2.22p . 

It is interesting that the problem is mich easier for homogeneous equations, that is, 
mult i- dimensional analogues of the geometric Brownian motion. In those models, esti- 
mation of the drift matrix in any number of dimensions leads to the LAN situation as 
long as the diffusion matrix is non-degenerate; see [9] for details. 

The finite-difference equation arising from discretization of fll.ip presents other chal- 
lenges. In particular, the noise sequence driving the equation is no longer independent; 
see [121 Section 5] for details. While these difficulties can be resolved in the ergodic 
case ([!]), the general case remains unsolved. 
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